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Abstract. I review here the latest measurements related to vector boson production in 
association with light or heavy flavor jets at the Tevatron and at the LHC. The methods to 
estimate the W+jets and Z+jets background in top-quark analyses are also presented. 



1. Introduction 

Vector boson production in association with jets (l^-|-jets) is an important process. First, it 
allows to test perturbative quantum chromodynamics (QCD) in a stringent way. It is also used 
to check Monte Carlo (MC) simulation and stands as one of the favoured processes to tune this 
simulation. These processes are particularly interesting because they allow precision QCD tests 
with multiple renormalisation or factorisation scales set by the vector boson mass and by the 
associated light or heavy-flavor jet transverse momentum. Precise measurements of the V^-l-jets 
properties are therefore of primary importance. 

In top-quark physics, F-l-jets production is one of the main backgrounds in the lepton-l-jets 
(^+jets) and dilepton (ii) channels. In both cases, vector boson production in association with 
heavy quarks (mainly 6-quarks) leads to the same final state as the production of a top-antitop 
quark pair (ti). From a theoretical point of view, predictions exist for V+ < 3 jets at the 
Tevatron and V+ < 4 jets at the LHC. 

Different jet reconstruction algorithms are used at the Tevatron and the LHC. At the 
Tevatron, the midpoint cone algorithm is used with a cone radius of AR = ^/Arf~+~A^ = 0.4 
for CDF and AR = 0.5 for DO. The anti-kt algorithm is used at the LHC with AR = 0.4 for 
ATLAS [Ij and AR = 0.5 for CMS [5] which also exploits the technique of particle flow for 
jet reconstruction. The LHC results presented in this article are based on 7 TeV data. These 
reconstructed jets are calibrated for the electromagnetic and hadronic scales. For the MC/data 
distributions shown in the following, the measurements are corrected at the particle level within 
the acceptance cuts. This means that the results are valid for specific ranges of lepton and jets 
kinematics. 

2. F-|-jets experimental measurements 

2.1. W+jets experimental measurements 

At the Tevatron the VF-f jets measurements rely on jets with px > 20 GeV and < 2 for CDF 
or \y\ < 3.2 for DO. The measured differential cross sections are normalized to the total inclusive 
W production cross section so that cancellations of some systematic uncertainties occur. In 



some cases, the normalization is rather performed using the next-to-next-leading order (NNLO) 
inclusive W prediction. The correction for reconstruction effects is completed by bin-by-bin 
unfolding in CDF or by regularized matrix inversion in DO. The relative uncertainty on the 
results range from around 4% Vl^ + 1 jet to 20% for 14^ + 3 jets. In general, comparisons 
with normalized multileg generators or next-to-leading order (NLO) simulations show a good 
agreement. The predictions and measurements for the normalized differential cross section of 
W + eu > n jets in CDF [3] and DO [4j are shown in Figure [!} 
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Figure 1. Measured W +n jet differential cross section at DO for jet multiplicities n = 1 — 4 [3] 
(left) and jet multiplicity measurement normalized to the total inclusive W ^ ev cross section 
at CDF taj (right). 



At the LHC, jets with pr > 20 or 30 GeV and \y\ < 4.4 for ATLAS or \r]\ < 2.4 for CMS 
are selected. The normalization for the differential cross section measurements is performed in 
the same way as at the Tevatron. The correction for reconstruction effects is based on iterative 
Bayesian unfolding in ATLAS or singular value decomposition (SVD) in CMS. The uncertainties 
for the measurements of the W production vary from 8% for the + 1 jet production to 20% for 

+ 3 jets [5l|6]. At the LHC also, the comparison between multileg or NLO generators shows 
good agreement. Figure [2] shows the MC/data comparison for some differential cross section 
measurements in ATLAS and CMS. 

At the Tevatron, DO and CDF also measure the inclusive W + c jets cross sections. In 
addition, CDF studies the W + 1,2b jets cross sections. W + c jets production is interesting 
since it is sensitive to the s quark content of the proton and the process is calculable in 
multi-scale perturbative QCD. MC predicts that the process W + c represents around 5% of 
the + 1 jet production (when the jet has px > 10 GeV). Soft lepton tagging is used to 
identify the W + c production. With 4.3 fb~^ of data, CDF measures a{W + c) • Br{W — >• 
£1/) = 13.3^2 9 (stat + syst) pb for jets with px > 20 GeV and \ri\ < 1.5 in agreement 

with the NLO prediction of 11.3 ± 2.2 pb. With 1 fb"^ of data, DO measures = 



7.4±1.9(stat)li:^(syst)% [7\ in agreement with the prediction from Alpgen+Pythia simulation 
of 4.4 lb 0.3%. With 1.9 fb^^ of data, CDF also measures the cross section with at least one 
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tight 6-tagged jet using a template fit of the vertex mass to get a(W + b/bb) ■ Br{W — )• £z^) = 
2.74 lb 0.27(stat) ib 0.42(syst) pb [8j that appears to be higher by around 3a than the NLO 
prediction of 1.22 ib 0.14(syst) pb. The LHC experiments also study the VF+heavy flavor jet 
production. Using events with one 6-tag and a template fit to the vertex mass distribution in 
35 pb-\ ATLAS measures a{W + b/bb) ■ Br{W Iv) = 10.2 ib 1.9(stat) ib 2.6(syst) pb within 
the fiducial cuts |9| which is l.Scj higher than the NLO prediction of 4.8 ib 1.0 pb. Using events 
with one tag and a template fit of a secondary vertex discriminant in 36 pb^^, CMS measures 
ulw+jfts) = 14-3 ± l-5(stat) lb 2.4(syst)% using jets with pT > 20 GeV and |??| < 2.1 [10] in 
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Figure 2. W+jets cross section as a function of the px of the first jet in the event in ATLAS [5] 
(left) and the ratio (t{W + n jets)/a{W) in the electron channel compared to the expectations 
in CMS [6J (right). 



agreement with the NLO prediction of 13 it 2%. 
2.2. Z+jets experimental measurements 

Studies of Z+jets production at the Tevatron use jets with pT > 20 or 30 GeV and \r]\ < 2.1 
in CDF and \y\ < 2.5 at DO. The uncertainties of the differential measurements typically 
range from 7 to 10% (for Z + 2 jets). Measurements [11] are compared to leading order (LO), 
multileg and NLO generators. A good agreement is reached with multileg and NLO generators. 
The measured differential cross sections as a function of jet multiplicity or jet momentum are 
generally more precise than MC predictions. 

Measurements are also performed at the LHC using 36 pb^^ of data using jets with 
Pt > 30 GeV and \y\ < 4.4 for ATLAS and \ri\ < 2.4 for CMS. For reconstruction corrections, 
ATLAS uses a bin- by-bin unfolding while CMS uses SVD. CMS also tests the so-called Berends- 
Giele scaling by measuring ^ ^ ^ +n+ 1 j ets ) — o + nj3 showing that /3 is compatible with 0. The 
uncertainties range between 13 to 20%. A good agreement is observed between data |12l [6] and 
multileg and NLO generators. 

At the Tevatron, CDF measures the Z + b jets cross section using jets reconstructed with the 
midpoint cone algorithm with R = 0.7 and pT > 20 GeV and \r]\ < 1.5 while the leptons are 
selected using a neural network. Using 9.1 fb^^ of data, the cross section is extracted using a 
template fit of the secondary vertex mass leading to: '^j^^z-t/etlf ~ 2.08±0.18(stat) ±0.27(syst)% 
in agreement with the predictions from MCFM and corresponding to a K-factor of 1.6 compared 
to the Alpgen prediction. At DO, jets with pT > 15 or 20 GeV and |r/| < 2.5 are used. A 
template fit of a discriminant variable built with the secondary vertex mass and the probability 
from tracks to originate from the primary vertex is used. With 4.2 fb~^ of data, DO measures 
'"ifz+/etff = 1-93 ± 0.22(stat) ± 0.15(syst)% [l3] in agreement with the prediction from MCFM 
of 1.92 ±0.22%. 

At the LHC, the Z+h jets cross section is measured with jets with pT > 25 GeV and 
\y\ < 2.1 with a template fit of the secondary vertex mass yielding 3.55]to 74(stat)^Q55(syst) it 



0.12(lumi) pb [14j in agreement with the predictions from MCFM of 3.88±0.58 pb. With 2.2 fb"^ 
of data, CMS measures independently the Z + lh and Z + 2h jets cross sections using jets with 
Pt > 25 GeV and \y\ < 2.1 and a template fit to the secondary vertex mass of the leading jet. 
The results give a{Z + b) = 5.84 ± 0.08(stat) ± 0.72(syst)+°-5^ (theory) pb [15j compatible with 
the prediction from MCFM, and a{Z + bb) = 0.37±0.02(stat) ±0.07(syst) ±0.02(theory) pb [l6] 
in agreement with the predictions from Madgraph. 

2.3. Summary of the V+jets measurements 

Even if a lot of y+jets measurements have been performed at the Tevatron and the LHC, only a 
few of them have been listed above. The y+light jets measurements agree with predictions from 
NLO and multileg generators (scaled to the NNLO inclusive prediction) . The measurements are 
already enabling to constrain the MC predictions since their precisions are around 20% for 
+ 3 jets and 15% for Z + 2 jets. In general, measurements of y+heavy flavor jets agree with 
predictions from NLO and scaled multileg generators. However, some discrepancy exists in the 
measurement of the W + b jets cross section at CDF while the LHC measurements so far agree 
with the predictions for the same process. The measurement uncertainties are generally larger 
than the ones on MC predictions. Typically, the precisions obtained in data are around 25% 
ioi W + c jets, 20-30% iov W + b jets and around 15-30% for Z + 6 jets. Further measurements 
with higher statistics are expected, both, from Tevatron and LHC in the near future. LHC 
measurements with ^/s = 8 TeV are also expected. 

3. y-|-jets background determination in top-quark analyses 

Depending on the experiments, on the colliders and on the analyses, different techniques are used 
to evaluate the background from F+jets in top-quark measurements. Only some of them will be 
discussed below which are based on the methods used for tt and single top-quark cross section 
measurements. The general rule (which suffers from exception) is that the T^+jet background 
shapes in top-quark analyses are usually taken from MC (either from Alpgen or Madgraph) while 
the y+jets normalization is usually measured directly on data. The ratio of y+heavy flavor 
(hf) jets over l^+light jets is rescaled using NLO predictions and/or checked or constrained 
using data itself. When both light and hf jet MC samples are produced separately, there is some 
overlap of hf jets in the light flavor jet samples because hf jets can originate from matrix element 
(ME) or parton shower (PS). A procedure to remove the overlap needs to be established. A 
possible removal can be based on jet angles, i.e. for a given heavy flavor jet, the jet from PS is 
kept if there is a jet j with AR{j, hf) < 0.4 otherwise the jet from ME is kept. 

3.1. W+jets background determination 

The VF+jets background is of particular importance in the lepton+jets channel. 

At CDF, Alpgen+Pythia is used to model the shapes of the M^+jets background. In the tt 
lepton+jets cross section measurements, the M^+jets background is normalized to data before b- 
tagging and after subtraction of the other backgrounds. The ti cross section was measured using 
a fit to the distribution of a neural network based flavor separator in 9 samples (1— > 5 jets, 
1 or 2 b-tag) |17| . The background is constrained with the 1 or 2 jets sample with floating 
normalization for the bb, cc and single c MC samples. Fitting the normalization allows to reduce 
the VF-jets related-systematic uncertainties. The measured K-factor related to Alpgen+Pythia 
is KjjI = l-39]'^g'22 in agreement with the theoretical prediction. In CDF single top cross section 
measurements, the VF+n jets pretag data sample is used to scale the prediction from Alpgen. 
The flavor separator is fitted in the 1 jet sample to get Khf = 1.4 ± 0.4. A 30% uncertainty is 
assessed for the extrapolation to the VF+2,3 jets samples. 

DO is also using Alpgen+Pythia for modelling the VF+jets background. In the tt lepton+jets 
cross section measurement, the normalization is fitted after subtraction of the other background 



together with the tt cross section in bins of number of jets. After checking on data, the 
NLO predictions for the ratio Rhj = N{W + hb/cc)/N{W + light) = 1.47 ± 0.22 and Rc = 
N [W + c) / N [W + light) = 1.27ib0.15 are used which translates into an uncertainty of 0.2 pb on 
the measured tt cross section [18j . R^f can also be fitted together with the it cross section using 
samples with 1 or 2 jets, with or without 6-tagging leading to Rhf = 1.55±0.09(stat)lo;]^g(syst). 
The uncertainty on Rhf translates into an uncertainty of 0.02 pb in the fitted it cross section. 
In the single top quark cross section analysis, the VF+jets background is normalized with the 
multijet background before 6-tagging using the lepton pT, the missing transverse energy {^t) 
and the W transverse mass distributions. The VF+hf scale factor is taken from the NLO 
predictions and cross-checked with the 0-tag 2 jet sample. A 12% uncertainty is assigned to 
this normalization. Figure [s] shows the CDF and DO distributions on control samples in the tt 
cross section analyses. 
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Figure 3. Output of the flavor separator after the flt split by the number of jets and tags in 
CDF (left), jet multiplicity distribution for events with 6-tagged jets at DO [18j (right). 



In ATLAS, Alpgen+Herwig is used to generate the VF+jets MC samples. In the it lepton+jets 
cross section analysis, the T^+jets normalization is fitted together with the tt cross section. 
Alternatively, the ratio of the number of produced and W~ is used. At the LHC, due to 
the quark content of the protons, more are produced than W~ in T^+jets processes. The 
ratio rMC = /W~ is well known theoretically. Hence the total number of VK+jets before 
tagging (in the 4 jet inclusive bin) can be extracted from the value of rMC in the MC with the 
formula: Ny^pretag = f^j^z^(-D'^ — D~) where D^,D~ is the number of events with or 
in data after selection. The hf scale factor is determined in the 1, 2 jet sample without 
and with 6-tagging. In single top cross section measurements, the VF+jets sample is scaled by 
a factor K=1.2 which corresponds to the inclusive NNLO predictions from FEWZ. The VF+hf 
normalization is fitted together with the signal. 

CMS is using Madgraph+Pythia to generate the VF+jets MC samples. In their untagged 
it cross section analysis, a template fit of discriminating variables (like the distribution of ^t) 
allows to extract the VF+jets normalization and the tt cross section. In the tagged analysis, a 
template fit of the secondary vertex mass is performed to get the VF+jets normalization and 
the hf scale factor. The extracted hf scale K^j is found to be slightly higher than the NNLO 
predictions: Kw+b jets = l-S^oJ and Kw+c jets = 1-4 ± 0.2. For the single top quark cross 
section measurements, a template fit of the non-tagged jet pseudo-rapidities is exploited in 
sidebands using the 2 jets, 1 6-tag sample. Figure |4] shows typical distributions for the VF+jets, 
tt combined fit at the LHC. 

3.2. Z+jets background determination 

The Z+jets background is mostly relevant for the analyses in the dilepton channel. As for the 
VK+jets samples, the Tevatron experiments are using Alpgen+Pythia to generate the Z+jets MC 
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Figure 4. Result of combined fit to data in tlie 3 to 5-jet bin in ATLAS [I9j (left), Results of 
the combined fit in the muon channel for exactly 1 6-tag at CMS [20j (right). 



background while ATLAS is using Alpgen+Herwig and CMS Madgraph+Pythia. Depending 
on the collaborations, the number of Z+light jets or Z+M jets is taken directly from the 
predictions of the multileg generators scaled to the NNLO computations, or it is evaluated 
using a data-driven method. The data-driven method scales the data in the Z mass window 
after selection. The MC is used afterwards to predict the ratio of events inside and outside 
the Z mass region. The data driven extraction agrees with the MC predictions but usually leads 
to smaller uncertainties (1-3% uncertainties on the tt cross section). 

4. Conclusion 

A lot of y-|-jets measurements have been performed by the CDF, DO, ATLAS and CMS 
experiments at the Tevatron and the LHC. The measurements agree with the NLO MC 
predictions. Multilegs generators are suitable to reproduce most of the F-l-jets kinematics. 
Still, more results and checks are expected to come. Various methods are used to measure 
the y-|-jets background in top-quark analyses. Depending on the needed precision, multileg 
generators are scaled to NLO predictions or data driven normalization are used or fit together 
with the top-quark signal. Thanks to the advanced background estimation technique, the 
achieved uncertainty on the y-|-jets background is usually found to be small. 
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